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Abstract 

Stanley Milgram's small world experiment presents "six degrees of separation" of 
our world. One phenomenon of the experiment still puzzling us is that how individuals 
operating with the social network information with their characteristics can be very 
adept at finding the short chains. The previous works on this issue focus whether on 
the methods of navigation in a given network structure, or on the effects of additional 
information to the searching process. In this paper, we emphasize that the growth 
and shape of network architecture is tightly related to the individuals' attributes. We 
introduce a method to reconstruct nodes' intimacy degree based on local interaction. 
Then we provide an intimacy based approach for orientation in networks. We find 
that the basic reason of efficient search in social networks is that the degree of "inti- 
macy" of each pair of nodes decays with the length of their shortest path exponentially. 
Meanwhile, the model can explain the hubs limitation which was observed in real-world 
experiment. 

China has a famous shortest poem titled life: net (by poet Bei Dao). This one word 
poem tells us that any one in the world is living in the invisible social networks. In the 
1960s, Stanley Milgram has revealed a striking feature of these networks by his famous 
experiment [1]. He showed us that any two people can be linked by a short chain of friends. 
The average length of the chains is about six, which is quite remarkably close to Karinthys 
prediction 40 years earlier |2J. This fascinating result has been popularized in the 1990s by 
John Guare's successful play "six degrees of separation" [3j and has been known as small 
world phenomena. More recent empirical studies using the Internet have demonstrate the 
similar conclusion [H [5] . 

Actually, Milgram's experiment showed us two issues of special interest: first, is the 
existence of short paths, and second, is the ability of people at finding them efficiently. The 
first issue has been extensively studied, especially with the Watts-Strogatz Small- World 
Network (SW) model (see Ref. [SI [7] and references therein for review, such as [HIS])- Here 
we focus only on the second issue that we still lack an equally complete understanding. 
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Kleinberg first noticed the searching problem and presented excellent models with perfect 
mathematical analysis |10[ [TT] . He modeled the social network as a lattice based network 
with some long-range connections. Under the condition that every node knows the lattice 
coordinates of his immediate neighbors (acquaintances) and the target, the letter delivering 
process was: each letter holder (node) forwards the letter across a connection that brings 
it as close as possible to the target in lattice distance. Kleinberg found that the network is 
searchable when the long-range connections obey a special distribution. Moreover, he has 
given some results on hierarchical network models |12j . Watts et al presented a hierarchical 
network based model which is more approximate to the real-world social network and got 
some interesting results by numerical simulation |13| . There also exist some other models try 
to present a framework of the second issue such as combining random walks and targeting 
searches at nodes with high degree [14]. Greedy routing and its modification have been 
studied extensively, by computer and social scientists [l5l [TH [171 UHl US [20] . 

The above studies on network navigation have indeed captured some basic features of 
Milgram's experiment, such as the letter delivering process is mainly based on geographic 
proximity and similarity of profession. But they all need some special network structures. 
So can we develop an approach to navigate networks efRciently without any requirement for 
network structure? 

From the original Milgran's experiment to modern empirical studies [T], we know that 
the letter holder chose his next recipient based on their location, profession, education, 
and other interests. It seems that without the information in addition to the network 
structure, it is impossible to search networks efficiently. So some researchers have suggested 
to overlap another network that describe the relationships of individuals' attributes such 
as location or profession to social network, so that the social network is searchable when 
we combine the information of these two networks ^21j . But we argue that the alleged 
additional information to networks is indeed tightly related to social networks. Obviously, 
the probability of acquaintance is actually related to the proximity between individuals' 
attributes. The structural properties of social network should be shaped by these factors. 
The formation and evolution of social networks are affected or even determined by the 
individual characters. That is why the individuals' attributes could give us some information 
about network structures and the social networks can be searched efficiently based on these 
factors. Recently, just when we prepared this manuscript, Boguria at al have also indicated 
that social distances among individuals have a role in shaping the network architecture and 
that, at the same time, these distances can be used to navigate the network. They discussed 
the effects of hidden metric space to the node similarity and navigability of networks [52] . 

From above arguments, we know that the individuals' attitudes and network structures 
are correlated each other tightly. The individuals' attitudes affect the network evolution 
and thus they should embedded in the network topology. Then, the problem becomes: can 
we recover the embedded information about the nodes in networks and use it to realize the 
effective searching? 

In this article, we introduce a method to get nodes' intimacy through local interactions. 
Then we present an approach of orientation in network based on the "intimacy degree" . We 
assign an n-dimensional vector to each node to describe its attribute. This vector could 
have the information of other nodes in the network through a series of acquaintance. It 
could be abstracted from the network structure and can measure the "intimacy degree" of a 
node with any other nodes. A pair of nodes will be more intimacy if they are near and less 
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intimacy if they are far away in the network. The process of dehvering letter in the network 
is that the current letter holder always forwards the letter to the candidate who has the 
largest intimacy with the target. It has been demonstrated in the following discussion that 
our approach is very efficient and can be used in many network searching problems. 

1 Intimacy degree 

Now we will reconstruct the individual's attributes related with the network structure. Based 
on the assumption that social network topology contains the individuals' attributes that 
affect the network evolution. Here, an individual's attributes on the network could described 
as its intimacy degree with other nodes. The intimacy can be regarded as the integration of 
the similarities of the occupations, hobbies, locations, or nationalities etc. Suppose there is 
a connected network with n nodes. We assign an n-dimensional vector to each node i. Its 
jth element Vi{j) denotes the degree of intimacy of node i to node j. Ifi — j, we set Vi{j) — a 
(a > 1), which indicates the intimacy degree of each node to itself is a constant a always. 
In the initial, for each i and j, we set Vi{j) — a when i — j, otherwise, Vi{j) — 0. Each 
time we update the intimacy vectors of every node by local interaction parallelly. Suppose 
node i has k neighbors (in this paper, we say node /i is a neighbor of node i always means 
that there is an edge form node i to node h, if the network is a directed network), which are 
Nl ,Nf ■ ■ ^Nf . v^i, v^2, • • • ,v^fc are A: intimacy vectors of the fc neighbors. Then can 
be updated through the interaction with its neighbors according to the following four steps: 

k 

(l)v, = ^v^.; {2)v,{i)^0- (3)v, = ^ ; (4)^;,(^) = a; (1) 

In a word, we always keep Vi{i) = a, and the sum of all the other elements of is 1. The 
intimacy vectors will be fixed within O(lnn) steps of evolution (FiglT]). So by a proper 
steps of iteration, the intimacy vector of every node can be given. The element (j ^ i) 

denotes the comparative intimacy of node i to node j. Our crucial findings are that the degree 
of intimacy of each pair of nodes decays with the length of their shortest path exponentially 
in statistical sense (Sup. Theorem 1 and Fig[21). It indicates that the degree of intimacy of 
each pair nodes is dominated by the shortest path between them in statistical sense. Does 
it contravene to common sense? Suppose people only know ^ of his neighbor's information, 
where w > 1. Then one will know only of information about his neighbor's neighbor. In 
this way we can easy conclude that degree of intimacy of each pair of nodes (here we regard 
the amount of information one knows about the other as intimacy degree) decays with the 
length of their shortest path exponentially. 

2 Orientation 

Obviously, in Milgram's experiment, current message holder always try to forward the mes- 
sage to a immediate neighbor who seems can send the message to the target most quickly. 
How to chose the suitable neighbor? Suppose every one has only the local information , that 
means each node knows and only knows his neighbor's intimacy vectors, We may think that 
people will always send the message to the neighbor who has the largest intimacy degree 
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Figure 1: Convergence speed, d signifies the average value of all the absolute differences of 
the intimacy degree between T and T — 1 steps. The numerical experiments are done in 
WS (each node link it's two nearest neighbors and one random long rang connection), K 
(Kleinberg one dimensional small world networks, each node link it's two nearest neighbors 
and one long rang connection with clustering exponent a = 1), and BA (scale- free network 
model with average degree 3) networks respectively. All the network size is n — 1000. From 
the plot we can see that the d drop with T exponentially. 




Shortest path length Shortest path length 



Figure 2: Average Intimacy degrees decline exponentially with the shortest path length. The 
simulations are done in the same networks as in Fig.l. (A)a = 100, the intimacy vectors 
evolved 100 times. (B) a = 1, the intimacy vectors evolved 10 times. Each point denotes 
the average intimacy degree and the corresponding length of the shortest path. From the 
two plots, we can safely conclude that the intimacy degree of each pair of nodes decays 
with the length of their shortest path exponentially. The slope approximate —InH, where 
H = a{k + 1) — 1 theoretically (See Supplementary Theorem 1). In plot (a), H = 302 and 
in plot (6), H = 5. They are consistent well with simulations. 
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Figure 3: Orientation ability. The numerical experiments is done in the in the same networks 
as in Fig.l. The horizontal axis D means the evolving time for intimacy degree is I? x AL 
{AL is the average shortest path length). Accuracy is defined as the value of jj-, where 
Ls, L denote the total length of the pathes of successful searches and the corresponding 
total length of shortest path length respectively. Success is defined as the where 5"^, S 
denote the total number of success searches and the corresponding total number of searches 
respectively. If within the searching steps of 2 x AL in each network, the message has 
not reached the target, the searching is defined as a failure search. From the plot we can 
conclude that when the evolved time is comparative to the average length of the shortest 
path. Orientation will work very well. 



with the target. So the orientation is the process that the current message holder i sends 
the message to its neighbor h, which has the most intimacy degree with the target t. And 
then node h will send the message in the same way until it reaches the target t. We can 
strictly prove that the degree of intimacy will decline inversely to the degree of the node 
by which intimacy passes (Sup. Theorem 1). It implies that "highly connected individuals 
(hubs) appears to have limited relevance to the kind of social search" which was observed 
in real- world experiment [4]. Moreover we also can prove that for a connected network, 
the chosen neighbor h is more intimate with node t compared to node i after the sufficient 
evolution steps (Sup. Theorem 2). This means that the message will not pass a node twice 
in the one sending process and always can reach the target t. 

In order to demonstrate our model works well in finding short chains, we define accuracy 
and success to evaluate the performance of our algorithm. Here accuracy Lg/ Lsearch means 
the consistence when the paths searched from algorithm is compared with the shortest paths, 
and success is the rate of success searches to reach the target in a given time steps, where, 
Lg is the sum of total shortest path lengths and Lg^arch is the corresponding sum of total 
searched path lengths. We do the numerical experiments on following three types artificial 
network : WS network which was presented by Watts and Strogatz [6] , K network which was 
presented by Kleinberg [10] and BA network which was presented by Barabasi and Albert 
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[S]. All artificial networks have 1000 nodes and the average degree is 3. Specially, the K 
network is based on 1-dimensional lattice, each node connects the two nearest neighbors 
and has one long-rang connection with clustering exponent a — 1. The numerical results 
indicate our algorithm works well (as shown in FigjS]). 

The time complexity for the evolution of intimacy vectors is 0{log{n)ii?). When intimacy 
vectors are established, the expectation of the time complexity for search is only 0{log{n)) 
in small world networks. 

3 Summary 

In summary, the intimacy based orientation can well explain the small world phenomena. It 
shows that the individuals can search the short path in social networks with only the local 
information. The basic reason is that the intimacy degree of the individuals will decay with 
the length of the shortest path exponentially. Moreover, it also can explain why successful 
social search is conducted primarily through intermediate to weak strength ties, does not 
require highly connected "hubs" to succeed. This phenomenon was observed in real-world 
experiment and cannot be explained by the previous researches. For application, the 
space complexity (for each node, we need an n-dimensional vector) of Orientation is higher 
than the previous algorithm such as Navigation [11]. But it is a decentralized algorithm 
naturally and the space complicity is not a challenging problem in decentralized computing 
system. We can also use community structures in networks to reduce the space complexity. 
Our orientation method has potential applications in P2P search system, traffic navigation 
system, Internet routing and so on in the future. 
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4 Supplementary 



4.1 Theorems and proof 

In a network, if there exist a path form node pm to node po- Pm — > Pm-i — *■ ■ • • POi 
then there must be a corresponding intimacy spreading path po ~* Pi ~* Pm which 

we call intimacy spreading path form po to p,„. That's to say we send message alone 
Pm Pm-i ■ ■ ■ — > POi and get nodes information alone po — > Pi ^ ■ ■ • — > Pm- It is 
obvious that there exist a intimacy spreading path from pq to Pm, there must be a path 
from p„ to po- 

Theorem 1: Suppose a >> 1 and there exist a constant H such that for any positive 
m, YiiLii^iO' + fci — 1) « i7™ in statistic sense, where ki is the out degree of node i (it is 
strict in lattice based networks). Then intimacy degree of p,„ possessed about po will decay 
with the length of their shortest path exponentially, and will decay inversely to the degree 
of the node by which intimacy passes. 

Proof: 

Suppose Pi has ki neighbors, N^^ , Np^ , • • ■ , iV^ ^ denote all the neighbors of pi and po 
is a neighbor of pi, v'^{y) denotes the intimacy degree of node x to node y after T steps 
evolution, v^{y) = a if y—x, otherwise 0. Without losing generality we let po = ^pi- 

■.■ Po — > pi — > ■ • • — > Pm is an intimacy spreading path from po to Pm- 

_pi _pi 



PI PI PI 



.'. from the EqU] we have ijj^ (po 

•.• < vl-\po) <l,i^l,---,ki 

PI 

■: a > 1 

•■• v^ipo)^ ^,T^l,2--- + oo 
•■• ^IAP'0~j-,T^1,2--- + ^ 

Case 1 (see Fig. Incase 1): suppose there exist only one shortest path (p„i Pm-i 
■ ■ ■ — > Po) from Pm to Po, then when the intimacy of po spread to pi currently alone po 
pi • • • -> p,„. 

Then the first intimacy degree (It means the intimacy degree possessed in the first time 
Of one pair of nodes) ^;p.(po) = fc,,^,,,,^;;.^^ « k,a+k.-k,^ = ^^ere, 

iVp. , • • ■ , are ki neighbors of pi and N^, — pi_i. (The above equation also implies the 
intimacy degree will decay inversely (^r^) to the degree of the node by which intimacy 
passes.) 

In this way we have: Vp^ (po) = a]!" i fc,a+fc.-i 
•.• l\T=lik^a + h - I) - H"' 
Then we have Wp„(po) ~ aH^^ 
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Po O » >G se >C Pp, (Case 1) 




(Case 2) 



(Case 3) 



Figure 4: The plot represents three cases in the proof. Case 1 denotes there are only one 
intimacy spreading shortest path from po to p„i- Case 2 shows there are two dependent 
shortest pathes and case 3 denotes 3 independent shortest pathes in which two pathes 
encounter at node p. 



Case 2 (see Fig. |3]case 2): When there are r independent (means there is no common 
node for each pair of shortest pathes) shortest intimacy spreading pathes from pq to pm 
through r neighbors of pm- Assume the length of the shortest intimacy spreading pathes 
from Pq to pm is m,then we can easily get that 

Case 3 (see Fig. 2] case 3): When there are r shortest intimacy spreading pathes from 
Po to Pm in which (3 shortest intimacy spreading pathes are dependent and other r — (3 are 
independent. All the situations are equal to this situation: all of the /3 dependent intimacy 
spreading pathes encounter at node p and they are independent from po to p and there only 
one shortest intimacy spreading path for p to Pm- Assume the length of shortest intimacy 
spreading pathes from p^ to pm is m, from pq to p is mi and p to Pm is TO2 ^ m — mi. 

According to case 1 and case 2 we have 

Vp„SPa) ~{r- /3)aH-"' + {taH~'''^)H-"'^ = raff"" 

Now, the task we face is to prove wj^ (po) w raH^™ for any T >= m. 

Obviously, wj^^(po) equal the sum of the each step (m to T) first intimacy degree, then 
we have ; 

raH-"' = vp„^ (po) < (po) < raH-"' + a[f^ + • • • , +^] where K is the 

maximum out degree of all nodes. 



Thus, v^Jpo) < raH-"' + + (f < rai?-™ + 2a 



.'. for sufficient large a, wj^(po) ~ raH~™. 

Theorem 2: For a connected network and a > 1. If « is not the target t then there must 
be at least one neighbor q of node i, q is more intimate with target t compared to node i 
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after sufficient long time evolution. This means that the message will not pass a node twice 
in the one sending process and always can reach the target t. 

Proof: 

■.■ the network is connected 
.•. node i at lest has a neighbor. 

Assume i has fc, neighbors which are N--, Nf, • • • , N^*. 

••■ -"°°(^') = \..+..-E^L.<TW ''''' + - <T^^ > 
.-. v+^U) < max{v;+f (j), • • • 

Therefore the above q must be existed and in the a sending process, the current message 
holder will always more intimacy with target t than the all previous message holder which 
implies that the message will not pass a node twice. Because the number of node of a 
network is finite, the message will and can reach the target t. 
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